R Basics with Tabular Data
نویسندگان
چکیده
منابع مشابه
Statistical Disclosure Control for Tabular Data in R
To perform statistical disclosure control (SDC) on tabular data is a challenging task because we need to ensure that every suppressed cell of a table has a suffi cient width of a confi dentiality interval under the presence of linear relations among cell variables. However, we fi nd that the existing SDC tool (i.e., τ-ARGUS) does not effectively support an output checking process of the on-site...
متن کاملTabular Data Cleaning and Linked Data Generation with Grafterizer
Over the past several years the amount of published open data has increased significantly. The majority of this is tabular data, that requires powerful and flexible approaches for data cleaning and preparation in order to convert it into Linked Data. This paper introduces Grafterizer – a software framework developed to support data workers and data developers in the process of converting raw ta...
متن کاملSoftware for tabular data protection.
In order for national statistical offices to maintain the trust of the public to collect data and publish statistics of importance to society and decision-making, it is imperative that respondents (persons or establishments) be guaranteed privacy and confidentiality in return for providing requested confidential data. Consequently, for most survey and census data, disclosure limitation techniqu...
متن کاملExtending controlled tabular adjustment for non-additive tabular data with negative protection levels
Minimum distance controlled tabular adjustment (CTA) is a recent perturbative methodology for the protection of tabular data. An implementation of CTA was recently used by Eurostat for the protection of European Union level structural business and animal production statistics. The realworld instances to be solved forced the classical CTA model to be extended with two features: first, to deal wi...
متن کاملsupraHex: An R/Bioconductor package for tabular omics data analysis using a supra-hexagonal map☆
Biologists are increasingly confronted with the challenge of quickly understanding genome-wide biological data, which usually involve a large number of genomic coordinates (e.g. genes) but a much smaller number of samples. To meet the need for data of this shape, we present an open-source package called 'supraHex' for training, analysing and visualising omics data. This package devises a supra-...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Programming Historian
سال: 2016
ISSN: 2397-2068
DOI: 10.46430/phen0056